Graph- and surface-level sentence chunking
نویسنده
چکیده
The computing cost of many NLP tasks increases faster than linearly with the length of the representation of a sentence. For parsing the representation is tokens, while for operations on syntax and semantics it will be more complex. In this paper we propose a new task of sentence chunking: splitting sentence representations into coherent substructures. Its aim is to make further processing of long sentences more tractable. We investigate this idea experimentally using the Dependency Minimal Recursion Semantics (DMRS) representation.
منابع مشابه
Oriya Multiword Chunking using Lexical knowledge base of verbs
The multiword chunking is otherwise thought of as the shallow parsing technique which identifies the multiword chunks and their interdependencies. The paper presents the proposed solution to the problem. Here we have designed the model of the proposed syntactic processor which uses lexical knowledge base of verbs for identifying intra chunk boundaries and finally forming the inter dependencies ...
متن کاملDiscourse Chunking and its Application to Sentence Compression
In this paper we consider the problem of analysing sentence-level discourse structure. We introduce discourse chunking (i.e., the identification of intra-sentential nucleus and satellite spans) as an alternative to full-scale discourse parsing. Our experiments show that the proposed modelling approach yields results comparable to state-of-the-art while exploiting knowledge-lean features and sma...
متن کاملNew Phrase Chunking Algorithm for Myanmar Natural Language Processing
Chunking is the subdivision of sentences into non recursive regular syntactical groups: verbal chunks, nominal chunks, adjective chunks, adverbial chunks and propositional chunks etc. The chunker can operate as a preprocessor for Natural Language Processing systems. This study aims to propose new phrase chunking algorithm for Myanmar natural language processing. The developed new algorithm acce...
متن کاملA Supervised Learning based Chunking in Thai using Categorial Grammar
One of the challenging problems in Thai NLP is to manage a problem on a syntactical analysis of a long sentence. This paper applies conditional random field and categorical grammar to develop a chunking method, which can group words into larger unit. Based on the experiment, we found the impressive results. We gain around 74.17% on sentence level chunking. Furthermore we got a more correct pars...
متن کاملComplete Syntactic Analysis Bases on Multi-level Chunking
This paper describes a complete syntactic analysis system based on multi-level chunking. On the basis of the correct sequences of Chinese words provided by CLP2010, the system firstly has a Part-ofspeech (POS) tagging with Conditional Random Fields (CRFs), and then does the base chunking and complex chunking with Maximum Entropy (ME), and finally generates a complete syntactic analysis tree. Th...
متن کامل